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Device and Method for Generating and Executing Compressed Programs of 

a Very-Long-Instruction-Word Processor 

Descr i ption 

CROSS REFERENCE TO RELATED APPLICATIONS 

[00011 This application claims the benefit of German Patent Application 
No. 198 59 389.9 filed December 22. 1998 and International Patent 
Application No. PCT/DE/0405 0 filed December 21. 1999. both of which 
applications are hereb y incorporated by reference in their entireties herein. 

FIELD OF THE INVENTION 

fnn02i The invention relates to a method for controlling functional units in a 
processor, according to which, in a configuration, a sequence of primary 
instruction words consisting of multiple instruction word parts and originating 
from a translation of a program code is compressed and stored as a sequence of 
related program words. In a subsequent execution phase, sequential secondary 
instruction words consisting of a plurality of instruction word parts and having the 
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full instruction word width needed to control all functional units are generated 
from the sequence of program words. 

[0003] The invention also relates to a processor arrangement for carrying out 
the method having functional units, an instruction word memory associated with 
these functional units and an instruction word buffer for storing instruction words 
that have already been generated and have a width that is at least the size of the 
bit width of the secondary instruction word. 

BACKGROUND OF THE INVENTION 

[0flQ4] Processor arrangements of the aforementioned type contain functional 
units that operate in parallel to one another and that are controlled at every clock 
cycle by an instruction word. The particular instruction word is extracted from a 
program word that is taken from a program memory. 

[00051 For their part, the instruction words consist of a plurality of instruction 
word parts, where each individual instruction word part serves to control one 
functional unit. 

[00061 To improve the performance of processor arrangements, the goal is an 
increase in the processing width, which makes it necessary to increase the 
number of functional units. In general, this increases the bit width of the 
instruction words and thus also of the program words. The consequence of this 
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is the provision of corresponding storage space in the program memory, which 
occupies the majority of the area on the semiconductor chip. 

[00071 Since the size of the program words determines the size of the 
program memory, the goal is to reduce the size of the program words in order to 
reduce the need for memory space. A number of compression methods for this 
purpose are known. The most obvious method is described in the report by H. 
Weiss and G. Fettweis, [in English:] "Dynamic Codewidth Reduction for VLIW 
Instruction Set Architectures in Digital Signal Processors" (Proceedings of the 
3rd International Workshop on Signal and Image Processing IWSIP '96, pages 
517 to 520). 

[00081 In this method, the program words are assembled from sequential 
primary instruction words in such a way that secondary instruction words can be 
subsequently reproduced therefrom in that a secondary instruction word (VLIW), 
once it has been created, is written to an instruction word memory, and, in order 
to produce the next secondary instruction word, only those instruction word parts 
in the stored secondary instruction word are exchanged which differ between the 
stored secondary instruction word and the secondary instruction word to be 
generated. Consequently, the program word need only contain the information 
specifying which instruction word part differs and with what content it differs. 

[0009] It is thus possible to design the program words to be very narrow and 
thus save memory space. 
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foomi However, when there are great differences between the stored 
secondary instruction word and the secondary instruction word to be created, the 
width of the program word must be increased if these relatively great differences 
occur frequently, which entails the disadvantage of a relatively large memory 
space, or else the differences must be distributed over multiple program words. 
Thus, the secondary instruction word must be created from multiple program 
words over multiple clock cycles. This results in the disadvantage that it requires 
a relatively long time. 

[00111 Consideration is now being given to ways of overcoming this 
disadvanatge. In particular, attention is directed to specific ways of 
increasing the operating speed in an application while retaining a small 
program word widths. 

SUMMARY OF THE INVENTION 

[0Q12] H e nc e , th e obj e ct of th e inv e nt i on is to i ncr e as e op e rat i ng sp ee d i n an 
app l icat i on spocific mannorSystems and method for increasing operating 
speed in an application are provided while retaining a small program word 
width. 

[00131 As regards the method, the object is attained in that, as the result of 
the configuration a program word contains a first characteristic of a primary 
instruction word from a first group of preceding primary instruction words which 
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has the greatest degree of correspondence with the primary instruction word 
associated with the program word, and contains instruction word parts which 
differentiate the primary instruction word belonging to the program word from the 
primary instruction word belonging to the characteristic. In the execution phase, 
a second group — corresponding in number to the first group — of secondary 
instruction words, each of which is equipped with a second characteristic, is 
stored. By means of the first characteristic contained in the program word, a 
secondary instruction word corresponding to the associated primary instruction 
word is ascertained from the second group via the associated second 
characteristic, and the secondary instruction word corresponding to the program 
word is generated in that the instruction word parts contained in the program 
word are exchanged in the secondary instruction word from the secondary 
group. 

[00141 As early as the configuration phase, the program words can be 
constructed such that they contain only the minimum possible information for 
later generation of a secondary instruction word in the execution phase. This is 
achieved by referring back to the preceding primary instruction words that have 
the greatest correspondence with the primary instruction word to be configured. 
Since the secondary instruction words are to be generated during the execution 
phase with the same width and in the same sequence as the primary instruction 
words, the execution phase proceeds essentially in the reverse order to the 
configuration phase, and the already generated secondary instruction words 
corresponding to the primary instruction word most similar in the configuration 
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phase to a current primary instruction word are used one at a time to generate a 
new secondary instruction word. Since the instruction word with the greatest 
similarity or the greatest correspondence is always referred back to, the amount 
of information necessary to generate a new secondary instruction word can be 
kept as small as possible. In this way, it is possible to minimize the storage 
requirement of a program memory. 

[00151 In one embodiment of the method, provision is made for the first group 
to consist of a first number of primary instruction words directly preceding the 
primary instruction word in question. The second group consists of a second 
number of secondary instruction words that is at least equal to the first number, 
where, prior to the generation of the next sequential secondary instruction word, 
the most recent secondary instruction word is appended to the group as the last 
word, and the first secondary instruction word to have been added that is in 
excess of the second number is removed from the second group. The group 
thus always consists of the immediately preceding instruction words, one of 
which has a greatest possible similarity when the instruction words within a group 
differ in content. 

[0016] Another possible embodiment of the method is for the newly 
generated secondary instruction word to be appended to the second group in 
that this word is stored in place of the secondary instruction word that was used 
to generate it. 
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[001 71 This variant of the process precludes the possibility of the group filling 
up with instruction words having the same content when the same function is 
passed through multiple times. 

[00181 Another possibility is that the newly generated secondary instruction 
word is not stored. In this case, the same set of previously generated instruction 
words is always accessed each time a secondary instruction word is generated, 
which is advantageous when these stored instruction words are suitable as 
pattern words for the generation of other secondary instruction words, with the 
result that storage processes can be avoided by this means. 

[00191 In one embodiment of the method, provision is made for the first 
characteristic to be formed as a minimum code distance between the primary 
instruction word belonging to the program word in question and the primary 
instruction word with the greatest similarity. 

[00201 Another embodiment of the method provides for the second 
characteristic to consist of an address corresponding to the first characteristic 
that is the address of a preceding secondary instruction word in a memory used 
for storage of the second group. 

[00211 Both of these embodiments exhibit an especially simple generation of 
features. 

100221 A reduction in the program word width, and thus an associated 
reduction in the width of the program memory, can be achieved in that the 
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program word consists of a number of instruction word parts that corresponds to 
the number of instruction word parts to be differentiated which occurs most 
frequently within the configuration, and in that a plurality of program words are 
used to assemble secondary instruction words that require more than the 
number of instruction words stored in one program word for the secondary 
instruction word used for generation. Consequently, the width of the program 
memory satisfies the most common application cases, and no delay arises in the 
generation of secondary instruction words. Only in the relatively few cases 
where the number of instruction word parts to be changed exceeds the width of a 
program word are two or more program words required in two or more clock 
cycles to produce the secondary instruction word. 

[00231 A further reduction in program word width can be achieved in that the 
instruction word parts are compressed in one program word. This is 
accomplished in that the bit width is reduced to the extent that it is possible to 
represent the most frequently occurring instruction word parts, and in that 
multiple program words are used when instruction word parts occur that require a 
greater bit width in order to be represented. 

[00241 For this type of compression of the program word, it is especially 
useful for the width of the instruction word parts in the program word to be halved 
and for one or two program words to be provided for representation of the 
instruction word parts. For example, if the program word normally has a width of 
8 bits, 256 combinations are available. However, a large number of these 
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combinations is required only extremely rarely or not at all. Accordingly, the 
program word can be provided with a width of 4 bits, which is sufficient to 
represent the 16 most frequently occurring combinations. If a combination other 
than these should need to be represented, two or more program words are used 
for this purpose. However, since this occurs only very rarely, the savings in 
storage space for the program memory that is achieved is greater than the 
possible expenditure of time for generation of rare combinations. 

[0025] The object is also achieved in accordance with the invention by a 
processor arrangement that is characterized in that the instruction word buffer 
consists of a memory with selective line-by-line access. In contrast to the prior 
art shown, this affords the possibility of directly accessing various stored 
instruction words in order to produce a new secondary instruction word. Thus, it 
is not necessary to use just any stored instruction word for generating a new 
secondary instruction word, but rather the particular stored instruction word can 
be used that has the greatest similarity to the secondary instruction word to be 
generated. Hence, the resource requirements for changes are relatively small, 
requiring only a small bit width for the program word, and thus a small program 
memory. 

[0026] The invention is described in detail below with reference to an example 
embodiment. Th e app e rta i ning draw i ngs show: 
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BRIEF DESCRIPTION OF THE DRAWINGS 



[00271 Further features of the invention, its nature, and various 
advantages will be more apparent from the following detailed description 
and the accompanying drawings, wherein like reference characters 
represent like elements throughout, and in which: 

f0028i Fig. 1 is = a flowchart of th e i nv e nt i v e m e thod illustrating processor 
and program operation in accordance with in accordance with the priciples 
of the present invention and 

[0029] Fig. 2 th e princip le of th e i nv e ntiv e g e n e rat i on is schematic 
illustration of structure of the primary and secondary instruction words = 
generated in accordance with the priciples of the present invention 

100301 The following is a list of the reference numerals an d labels used 
in Fias. 1 and 2 

List of Reference Numerals and Labels 



1 


functional unit 


2 


processor 


1 


program code 


4 


translation 




primary instruction word 


I 


instruction word part 


program word 
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S program memory 

3 secondary instruction word 

lfl first characteristic 

U first group 

12 second group 

H second characteristic 

14 memory 



DETAILED DESCRIPTION OF THE INVENTION 

10031 1 in4h eWith reference to Figs. 1 and 2. in the operational method- 
r e pr e s e nt e d i n F i g. 1 for control of functional units 1 in a processor 2, a 
sequence of primary instruction words 5 is generated from a plurality of 
instruction word parts 6 from a program code 3 by means of a translation 4 in a 
configuration phase. This sequence of primary instruction words 5 is 
compressed and stored in a program memory 8 as a sequence of related 
program words 7. 

[00321 The program words 7 consist of a number of instruction word parts 6 
corresponding to the most frequently occurring number of instruction words 6 to 
be differentiated within the configuration. In order to compose secondary 
instruction words 9, more than the number of instruction word parts 6 stored in 
one program word 7 may be needed. Then a plurality of program words 7 are 
used to do so. 
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[00331 In a subsequent execution phase, sequential secondary instruction 
words 9 consisting of a plurality of instruction word parts 6 and having the full 
instruction word width needed to control all functional units 1 are generated from 
the sequence of program words 7. 

[0034] In the result of the configuration, a program word 7 has a first 
character-istic 10 of a primary instruction word 5 from a first group 1 1 of 
preceding primary instruction words 5, which has the greatest similarity to the 
primary instruction word 5 associated with the program word 7, and contains 
instruction word parts 6 that differentiate the primary instruction word 5 belonging 
to the program word 7 from the primary instruction word 5 belonging to the 
characteristic. The first characteristic 10 is formed as a minimum code distance 
between the primary instruction word 5 belonging to the relevant program word 7 
and the primary instruction word 5 with the greatest similarity. 

[00351 In the execution phase, a second group 12 of secondary instruction 
words 9 — corresponding in number to the first group 1 1 — each of which is 
provided with a second characteristic 13, is stored. The second characteristic 13 
is formed from a line number of a memory 14 that serves to store the second 
group 12. 

[0036] By means of the first characteristic 10 contained in the program word 
7, a secondary instruction word 9 corresponding to the associated primary 
instruction word 5 is ascertained from the second group 12 via the associated 
second characteristic 13. The secondary instruction word 9 corresponding to the 
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program word is generated in that the instruction word parts 6 contained in the 
program word 7 are exchanged in the secondary instruction word 9 from the 
second group 12. 

The first group 1 1 consists of a first number of the primary instruction words 5 
preceding the primary instruction word 5 in question. The second group 12 
consists of a second number of secondary instruction words 9, which is at least 
equal to the first number, where, prior to the generation of the next sequential 
secondary instruction word 9, each most recent secondary instruction word 9 can 

be appended to the second group 12 as the last word. The first secondary 
instruction word 9 to have been added that is in excess of the second number is 

removed from the second group 12. 
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Method for controll i ng functiona l un i ts i n a processor and processor 
arrangement for carry i ng out th e m e thod 

L i st of Symbo l s 

4 funct i ona l unit 

2 proc e ssor 

3 program cod e 

4 translation 

§ pr i mary i nstruct i on word 

& i nstruct i on word part 

1 program word 

% program m e mory 

9 s e condary i nstruction word 

40 f i rst charact e rist i c 

44 f i rst group 

42 s e cond group 

43 s e cond charact e r i st i c 

44 m e mory 
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Claims 



1 . Method for controlling functional units in a processor, according to which, 
in a configuration, a sequence of primary instruction words consisting of 
multiple instruction word parts and originating from a translation of a 
program code is compressed and stored as a sequence of related 
program words, and according to which, in a subsequent execution 
phase, sequential secondary instruction words consisting of a plurality of 
instruction word parts and having the full instruction word width needed to 
control all functional units are generated from the sequence of program 
words characterized in that, 

in the result of the configuration, a program word (7) has a first 
characteristic (10) of a primary instruction word (5) from a first group (1 1 ) 
of preceding primary instruction words (5), which has the greatest 
similarity to the primary instruction word (5) associated with the program 
word (7), and contains instruction word parts (6) that differentiate the 
primary instruction word (5) belonging to the program word (7) from the 
primary instruction word (5) belonging to the first characteristic (10) 

and in that, in the execution phase, a second group (12) of secondary 
instruction words (9) — corresponding in number to the first group (11) — 
each of which is provided with a second characteristic (13), is stored, and 
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in that, by means of the first characteristic (10) contained in the program 
word (7), a secondary instruction word (9) corresponding to the associated 
primary instruction word (5) is ascertained from the second group (12) via 
the associated second characteristic (13), and the secondary instruction 
word (9) corresponding to the program word (7) is generated in that the 
instruction word parts (6) contained in the program word (7) are 
exchanged in the secondary instruction word (9) from the second group 
(12). 

2. Method in accordance with claim 1 , characterized in that the first group 
(1 1 ) consists of a first number of primary instruction words (5) that directly 
precede the primary instruction word (5) in question and 

in that the second group (12) consists of a second number of secondary 
instruction words (9) that is at least equal to the first number, where, prior 
to the generation of the next sequential secondary instruction word (9), 
each most recent secondary instruction word (9) is appended to the 
second group (12) as the last word, and the first secondary instruction 
word (9) to have been added and that is in excess of the second number 
is removed from the second group (12). 

3. Method in accordance with claim 1 , characterized in that the newly 
generated secondary instruction word (9) is appended to the second 
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group (12) in that the former is stored in place of the secondary instruction 
word (9) that was used for its generation. 

4. Method in accordance with claim 1 , characterized in that the newly 
generated secondary instruction word (9) is not stored. 

5. Method in accordance with one of claims 1 through 4, characterized in 
that the first characteristic (10) is formed as a minimum code distance 
between the primary instruction word belonging to the program word in 
question and the primary instruction word with the greatest similarity. 

6. Method in accordance with one of claims 1 through 5, characterized in 
that the second characteristic (13) consists of an address corresponding 
to the first characteristic that is the address of a preceding secondary 
instruction word in a memory (14) used for storage of the second group 
(12). 

7. Method in accordance with one of claims 1 through 6, characterized in 
that the program word (7) consists of a number of instruction word parts 
(6) that corresponds to the number of instruction word parts (6) to be 
differentiated that occurs most frequently within the configuration, and in 
that a plurality of program words (7) are used to assemble secondary 
instruction words (9) that require more than the number of instruction 
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words (9) stored in one program word (7) for the secondary instruction 
word (9) used for generation. 

8. Method in accordance with one of claims 1 through 7, characterized in 
that the instruction word parts (6) are compressed in one program word 
(7) by reducing the bit width to the extent that it is possible to represent 
the most frequently occurring instruction word parts (6), and in that 
multiple program words (7) are used when instruction word parts (6) occur 
that require a greater bit width in order to be represented. 

9. Method in accordance with claim 8, characterized in that the width of the 
instruction word parts (6) in the program word (7) is halved, and one or 
two program words (7) are provided for representation of the instruction 
word parts (6). 

10. Processor arrangement for carrying out the method, having functional 
units, an instruction word memory associated with these functional units 
and an instruction word buffer for storing instruction words that have 
already been generated and have a width that is at least the size of the bit 
width of the secondary instruction word, characterized in that the 
instruction word buffer consists of a memory (14) with selective line-by- 
line access. 
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ABSTRACT 



A method for controlling functional units in a processor- 
according to which in a configuration a sequence of primary instruction 
words which consists of several instruction word parts and originates from 
a translation of a program code is compressed and stored as a sequence 
of associated program words. The invention also relates to a processor 
system for carrying out this method. The aim of the invention is to increase 
operating speed in an application-specific manner while retaining a low 
program word width. To this end, as regards the method, a program word 
contains a first characteristic of a primary instruction word and instruction 
word parts which differentiate the primary instruction word belonging to 
the program word from the primary instruction word belonging to the 
characteristic. By means of the first characteristic contained in the 
program word a secondary instruction word is generated by exchanging 
the instruction word parts contained in the program word with those in a 
preceding secondary instruction word. On the system side the aim of the 
invention is solved by providing for the instruction word buffer to consists 
of a memory with optional line-by-line access. 
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